-
Notifications
You must be signed in to change notification settings - Fork 161
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Refactor CI matrix. #1844
Refactor CI matrix. #1844
Conversation
Oh I should've mentioned, the images without the OS in the name are 24.08: - rapidsai/devcontainers:24.06-cpp-gcc13-cuda12.4
+ rapidsai/devcontainers:24.08-cpp-gcc13-cuda12.4 |
cd5ceec
to
0ac59a8
Compare
We no longer need this to lookup images.
0ac59a8
to
a0b813a
Compare
versions: | ||
14.16: { stds: [ 14, ], aka: '2017' } | ||
14.29: { stds: [ 14, 17, ], aka: '2019' } | ||
14.36: { stds: [ 14, 17, 20] } |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm not sure 14.36 exists in the devcontainers. I'll go poke around. We'll need to update to 14.40 soon.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It's only used for the CTK12.0 images that cudax will need once we enable those builds. Not sure if it was intentionally used there or if it just missed an upgrade.
v100: { sm: 70 } # 32 GB, 40 runners | ||
t4: { sm: 75, testing: true } # 16 GB, 8 runners | ||
rtx2080: { sm: 75, testing: true } # 8 GB, 8 runners | ||
rtxa6000: { sm: 86, testing: true } # 48 GB, 12 runners | ||
l4: { sm: 89, testing: true } # 24 GB, 48 runners | ||
rtx4090: { sm: 89, testing: true } # 24 GB, 10 runners | ||
h100: { sm: 90 } # 80 GB, 16 runners |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
minor suggestion: Instead of trying to abstract that fact that some GPUs have different runner labels, what do you think about just making runner_label:
an explicit property of defining a GPU? I can easily foresee new runner labels getting added/changed with new structures that would break the existing logic.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That's a good point, we may need something more flexible here at some point. We can easily deal with that when/if it becomes an issue, but I'd rather not pre-emptively change this without a usecase.
New version of gcc struggles with replace.cu.
25479c9
to
c9d3102
Compare
🟩 CI finished in 6h 39m: Pass: 100%/365 | Total: 1d 19h | Avg: 7m 05s | Max: 1h 00m | Hits: 96%/521798
|
Project | |
---|---|
+/- | CCCL Infrastructure |
libcu++ | |
CUB | |
+/- | Thrust |
CUDA Experimental |
Modifications in project or dependencies?
Project | |
---|---|
+/- | CCCL Infrastructure |
+/- | libcu++ |
+/- | CUB |
+/- | Thrust |
+/- | CUDA Experimental |
🏃 Runner counts (total jobs: 365)
# | Runner |
---|---|
264 | linux-amd64-cpu16 |
56 | linux-amd64-gpu-v100-latest-1 |
24 | linux-arm64-cpu16 |
21 | windows-amd64-cpu16 |
🟩 CI finished in 11h 48m: Pass: 100%/420 | Total: 3d 16h | Avg: 12m 37s | Max: 1h 00m | Hits: 75%/523210
|
Project | |
---|---|
+/- | CCCL Infrastructure |
libcu++ | |
CUB | |
+/- | Thrust |
CUDA Experimental |
Modifications in project or dependencies?
Project | |
---|---|
+/- | CCCL Infrastructure |
+/- | libcu++ |
+/- | CUB |
+/- | Thrust |
+/- | CUDA Experimental |
🏃 Runner counts (total jobs: 420)
# | Runner |
---|---|
305 | linux-amd64-cpu16 |
64 | linux-amd64-gpu-v100-latest-1 |
28 | linux-arm64-cpu16 |
23 | windows-amd64-cpu16 |
Simplifies CI matrix.yaml file to close #1758.